Learning discriminative spatial representation for image classification

نویسندگان

  • Gaurav Sharma
  • Frédéric Jurie
چکیده

Spatial Pyramid Representation (SPR) [7] introduces spatial layout information to the orderless bag-of-features (BoF) representation. SPR has become the standard and has been shown to perform competitively against more complex methods for incorporating spatial layout. In SPR the image is divided into regular grids. However, the grids are taken as uniform spatial partitions without any theoretical motivation. In this paper, we address this issue and propose to learn the spatial partitioning with the BoF representation. We define a space of grids where each grid is obtained by a series of recursive axis aligned splits of cells. We cast the classification problem in a maximum margin formulation with the optimization being over the weight vector and the spatial grid. In addition to experiments on two challenging public datasets (Scene-15 and Pascal VOC 2007) showing that the learnt grids consistently perform better than the SPR while being much smaller in vector length, we also introduce a new dataset of human attributes and show that the current method is well suited to the recognition of spatially localized human attributes.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Deep Unsupervised Domain Adaptation for Image Classification via Low Rank Representation Learning

Domain adaptation is a powerful technique given a wide amount of labeled data from similar attributes in different domains. In real-world applications, there is a huge number of data but almost more of them are unlabeled. It is effective in image classification where it is expensive and time-consuming to obtain adequate label data. We propose a novel method named DALRRL, which consists of deep ...

متن کامل

Image Classification via Sparse Representation and Subspace Alignment

Image representation is a crucial problem in image processing where there exist many low-level representations of image, i.e., SIFT, HOG and so on. But there is a missing link across low-level and high-level semantic representations. In fact, traditional machine learning approaches, e.g., non-negative matrix factorization, sparse representation and principle component analysis are employed to d...

متن کامل

Joint sparse model-based discriminative K-SVD for hyperspectral image classification

Sparse representation classification (SRC) is being widely investigated on hyperspectral images (HSI). For SRC methods to achieve high classification performance, not only is the development of sparse representation models essential, the designing and learning of quality dictionaries also plays an important role. That is, a redundant dictionary with well-designated atoms is required in order to...

متن کامل

یادگیری واژه نامه برای آشکارسازی محل پلاک خودرو

Car license plate detection has been always a challenging task in the context of traffic control and traffic offenses. In this paper, the problem of license plate detection from gray scale images taken in natural conditions is addressed. Our car plate database consists of images with severe imaging conditions such as low quality images, far distanced cameras, and severe weather conditions. The ...

متن کامل

Discriminative and Compact Dictionary Design for Hyperspectral Image Classification using Learning VQ Framework Sparse representation provides an efficient description for high-dimensional Hyperspectral Imagery

Discriminative and Compact Dictionary Design for Hyperspectral Image Classification using Learning VQ Framework Report Title Sparse representation provides an efficient description for high-dimensional Hyperspectral Imagery (HSI) and also encodes discriminative information useful for classification. However, due to the large size of typical HSI images, the naive way to construct a dictionary wi...

متن کامل

Compact, Adaptive and Discriminative Spatial Pyramid for Improved Scene and Object Classification

The release of challenging datasets with a vast number of images, requires the development of efficient image representations and algorithms which are able to manipulate these largescale datasets efficiently. Nowadays the Bag-of-Words (BoW) based image representation is the most successful approach in the context of object and scene classification tasks. However, its main drawback is the absenc...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011